Adapting Self-Organizing Maps to the MapReduce Programming Paradigm
نویسنده
چکیده
We present an adaption of the self organizing map (SOM) useful for cluster analysis of large quantities of data such as music classification or customer behavior analysis. The algorithm is based on the batch SOM formulation which has been successfully adopted to other parallel architectures and perfectly suits the map reduce programming paradigm, thus enabling the use of large cloud computing infrastructures such as Amazon EC2.
منابع مشابه
A GPU-accelerated algorithm for self-organizing maps in a distributed environment
In this paper we introduce a MapReduce-based implementation of self-organizing maps that performs compute-bound operations on distributed GPUs. The kernels are optimized to ensure coalesced memory access and effective use of shared memory. We have performed extensive tests of our algorithms on a cluster of eight nodes with two NVidia Tesla M2050 attached to each, and we achieve a 10x speedup fo...
متن کاملAn ANALYSIS on VARIATIONS of INPUT PATTERN DISTRIBUTIONS in SELF-ORGANIZING MAPS in 2D
Self-organizing mapping is an unsupervised learning paradigm used in pattern classification and hence artificial intelligence. This paradigm is based on modifying the class features via the incoming input stimuli. Its exciting part is that it introduces concepts such as neighborhood or mapping. Hence the results obtained from this paradigm highly depend on the selected neighborhood and mapping ...
متن کاملVisual mining in music collections with Emergent SOM
We describe different ways of organizing large collections of music with databionic mining techniques. The Emergent Self-Organizing Map is used to cluster and visualize similar artists and songs. The first method is the MusicMiner system that utilizes semantic descriptions learned from low level audio features for each song. The second method uses tags that have been assigned to songs and artis...
متن کاملGreen Product Consumers Segmentation Using Self-Organizing Maps in Iran
This study aims to segment the market based on demographical, psychological, and behavioral variables, and seeks to investigate their relationship with green consumer behavior. In this research, self-organizing maps are used to segment and to determine the features of green consumer behavior. This was a survey type of research study in which eight variables were selected from the demographical,...
متن کاملClassification of Streaming Fuzzy DEA Using Self-Organizing Map
The classification of fuzzy data is considered as the most challenging areas of data analysis and the complexity of the procedures has been obstacle to the development of new methods for fuzzy data analysis. However, there are significant advances in modeling systems in which fuzzy data are available in the field of mathematical programming. In order to exploit the results of the researches on ...
متن کامل